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Abstract 


In this thesis, we study the downlink multiuser scheduling and power allocation problem 
for systems with simultaneous wireless information and power transfer (SWIPT). In the 
first part of the thesis, we focus on multiuser scheduling. We design optimal scheduling 
algorithms that maximize the long-term average system throughput under different 
fairness requirements, such as proportional fairness and equal throughput fairness. In 
particular, the algorithm designs are formulated as non-convex optimization problems 
which take into account the minimum required average sum harvested energy in the 
system. The problems are solved by using convex optimization techniques and the 
proposed optimization framework reveals the tradeoff between the long-term average 
system throughput and the sum harvested energy in multiuser systems with fairness 
constraints. Simulation results demonstrate that substantial performance gains can be 
achieved by the proposed optimization framework compared to existing suboptimal 
scheduling algorithms from the literature. In the second part of the thesis, we investigate 
the joint user scheduling and power allocation algorithm design for SWIPT systems. The 
algorithm design is formulated as a non-convex optimization problem which maximizes 
the achievable rate subject to a minimum required average power transfer. Subsequently, 
the non-convex optimization problem is reformulated by big-M method which can be 
solved optimally. Furthermore, we show that joint power allocation and user scheduling 
is an efficient way to enlarge the feasible trade-off region for improving the system 
performance in terms of achievable data rate and harvested energy. 
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P n (i) Power allocated to user n from the access point in the downlink in time 

slot i 
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Q req Required amount of average sum harvested energy 
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y n Lagrange multiplier which ensures proportional fairness 

T Step size for Lagrange multiplier y n in the gradient algorithm 
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0 Step size for Lagrange multiplier v in the gradient algorithm 
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Chapter 1. 

Introduction 

1.1. Energy Harvesting in Wireless Networks 

Over the past decades, battery-powered devices have been deployed in many wireless 
communication networks. However, since batteries have limited energy storage capacity 
and their replacement can be costly or even infeasible, harvesting energy from the 
environment provides a viable solution for prolonging the network lifetime. Although 
conventional natural energy resources, such as solar and wind energy, are perpetual, 
they are weather-dependent and location-dependent, which may not suitable for mobile 
communication devices. Alternatively, background radio frequency (RF) signals from 
ambient transmitters are also an abundant source of energy for energy harvesting 
(EH). Unlike the natural energy sources, RF energy is weather-independent and can be 
available on demand. Nowadays, EH circuits are able to harvest microwatt to milliwatt 
of power over the range of several meters for a transmit power of 1 Watt and a carrier 
frequency less than 1 GHz [1], Thus, RF energy can be a viable energy source for devices 
with low-power consumption, e.g. wireless sensors [2, 3] . Moreover, RF EH provides 
the possibility for simultaneous wireless information and power transfer (SWIPT) since 
RF signals carry both information and energy [4, 5], 

Currently, there are two main research directions on RF EH communications. The first 
direction studies the resource allocation algorithm design for SWIPT. A fundamental 
tradeoff between information transfer rate and energy transfer rate was investigated in 
[6] under the assumption that the receiver is able to decode information and extract 
power from the same received signal. However, current practical circuits that harvest 
energy from RF signals are not yet able to decode the carried information directly from 
the same signal in general, i.e., the signal that is used for EH, cannot be reused for 
information decoding 1 [7]. Consequently, a power splitting receiver was proposed in [6] 

*We note that it is possible to decode the carried information directly from the same signal if non-coherent 
modulation schemes are adopted, e.g., on-off keying, in which information is carried on the energy 
level of the carrier signal. 
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and [7] for facilitating simultaneous information decoding (ID) and EH. This receiver 
can be used for RF signals resulting from any modulation technique. 

A power splitting receiver [7]— [12] consists of a conventional energy receiver and 
a conventional information receiver. In particular, it splits the received signal into 
two power streams with power splitting ratios 1 — p(t) and p(t) at time instant t, cf. 
Figure 1.1, for harvesting energy and decoding the modulated information, respectively. 
Specifically, the power splitting unit is installed in the analog front-end of the receiver 
and is assumed to be a perfect passive analog device; it does not introduce any extra 
power gain, i.e., 0 < p(t) < 1, or noise to the received signal. In the extreme case, only 
ID is performed when p(t) = 1 and only EH is done when p(t) = 0, i.e., the receiver 
switches in time between the two modes. A receiver with p(t) e {0,1} is referred to as 
a time-switching receiver. Besides, the power splitting receivers also generalize the case 
of separated receivers in the literature [ 13]—[24] . 



n A (t) 


Vp(0y(0 




Information Receiver 

< 





Energy Receiver 



V i — p(0y(0 


Figure 1.1: Power splitting receiver. 


Multiuser SWIPT systems have recently drawn significant attention in academia. In 
[12], an orthogonal frequency division multiple access (OFDMA) system with SWIPT 
was considered. The authors designed a scheduling algorithm for the maximization 
of the energy efficiency of data transmission (bits/Joule delivered to the receivers) 
for a minimum required sum rate and a minimum required energy harvested by the 
users. In [11] and [14], beamforming design was studied for power efficient and 
secure SWIPT networks with imperfect channel state information (CSI) and perfect 
CSI, respectively. In [24], beamformers were optimized for the maximization of the 
sum harvested energy under the minimum required signal-to-interference-plus-noise 
ratio constraints for multiple information receivers. Multiuser multiple input multiple 
output (MIMO) SWIPT systems were studied for the broadcast channel in [6] and for 
the interference channel in [25], Nevertheless, multiuser scheduling, which exploits 
multiuser diversity for improving the system performance of multiuser systems, has 
not been considered in [2]-[23], Recently, simple suboptimal order-based schemes 
were proposed to balance the tradeoff between the users’ ergodic achievable rates and 
their average amounts of harvested energy in [26] . However, the scheduling schemes 
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proposed in [26] are unable to guarantee quality of service (QoS) with respect to the 
minimum energy transfer. In fact, optimal multiuser scheduling schemes that guarantee 
a long-term minimum harvested energy for SWIPT systems have not been considered in 
the literature so far. 

Another research direction focuses on wireless powered communication networks 
(WPCNs), where the wireless terminals in the network communicate using the energy 
harvested from wireless power transfer (WPT) as shown in Figure 1.2 [27]. 



U , 


± 




Figure 1.2: A general wireless powered communication network (WPCN) [27]. 


In [28], the authors studied a wireless powered cellular network in which dedicated 
power-beacons were used in the cellular network to transfer wireless energy to mobile 
terminals. In [29], a wireless powered sensor network was investigated, where a mobile 
charging vehicle moving in the network was employed as the energy transmitter to 
wirelessly power sensor nodes. In [30], a network architecture enabling secondary users 
to harvest energy as well as reuse the spectrum of primary users in the cognitive radio 
network was proposed. In [31], the authors studied a typical WPCN model and proposed 
a harvest-then-transmit protocol, in which a hybrid access point (AP) coordinated WPT 
in the downlink (DL) and wireless information transfer (WIT) in the uplink (UL) for a set 
of distributed users. The authors proposed a protocol for sum-throughput maximization 
and enhanced it by considering a fair rate allocation among different users. In [27], 
this system was extended to the case when the AP has multiple antennas. To improve 
the performance in wireless communication systems with EH, user cooperation was 
suggested and studied in [32-34]. In [35], full duplex technique was applied to a 
WPCN for further throughput improvement. Also, a new transmission protocol, enabling 
efficient simultaneous DL WPT and UL WIT over the same bandwidth, was proposed. 
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1.2. Overview of the Thesis 

In this thesis, we investigate a SWIPT system with one AP and multiple users. We 
consider DL transmission with one user receives information in each time slot, while the 
remaining users opportunistically harvest the ambient RF energy We design optimal 
scheduling algorithms that maximize the long-term average system throughput under 
different fairness requirements, such as proportional fairness and equal throughput 
fairness. We show that with fairness considerations, the feasible trade-off regions of 
achievable data rate and harvested energy (R-E) for the proposed schemes decrease 
due to the lost of degrees of freedom in resource allocation. Moreover, we study joint 
user selection and power allocation for the considered SWIPT systems. We propose 
an optimal resource allocation algorithm and reveal the R-E region of this scheme. In 
particular, we show that joint power allocation and user scheduling is a more efficient 
way to enlarge the feasible trade-off region which improves the system performance in 
terms of achievable data rate and harvested energy. 

The remainder of the thesis is organized as follows. In Section 2.1, we introduce the 
SWIPT system model. Sections 2.2 - 2.4 investigate the maximum throughput scheduling, 
proportional fair scheduling, and equal throughput scheduling schemes, respectively. 
For each scheduling scheme, we design the optimal user selection policy and verify 
its performance via simulation by comparing it with a baseline scheduling scheme. In 
Chapter 3, we consider joint user selection and power allocation for SWIPT systems 
and provide an optimal resource allocation algorithm. Finally, Chapter 4 concludes this 
thesis. 



5 


Chapter 2. 

Optimum Online Multiuser Scheduling 


In this chapter, we introduce our system model which consists of one AP and multiple 
users. We design optimal scheduling algorithms that maximize the long-term average 
system throughput under different fairness requirements, such as proportional fairness 
and equal throughput fairness. In particular, the algorithm designs are formulated 
as non-convex optimization problems which take into account the minimum required 
average sum harvested energy in the system. 

2.1. System Model 

We consider a SWIPT system that consists of one access point (AP) with a fixed power 
supply and N battery-powered user terminals (UTs), see Figure 2.1. The AP and the UTs 
are equipped with single antennas. Besides, we adopt time-switching receivers at the 
UTs [2] to ensure low hardware complexity. 



Figure 2.1: SWIPT system model. 
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We study the user scheduling for DL transmission. We assume that the transmission is 
divided into T time slots and in each time slot perfect CSI is available at the AR Also, 
the data buffer for the users at the AP is always full such that enough data packets are 
available for transmission for every scheduled UT. In each time slot, the AP schedules one 
user for ID, while the remaining users opportunistically harvest energy 1 from the received 
signal. We assume block fading channels. In particular, the channels remain constant 
during a time slot and change independently over different time slots. Besides, the users 
are physically separated from one another such that they experience independent fading. 

Furthermore, we adopt the EH receiver model from [7]. The RF energy harvested by 
user n E {1,... ,1V} in time slot i E {1,..., T} is given by 

Q„( 0 = SnPKd), (2-D 

where P is the constant AP transmit power, 0 < £ n < 1 is the RF-to-direct-current (DC) 
conversion efficiency 2 of the EH receiver of user n, and h n (i) is the channel power gain 
between the AP and user n in time slot i. 

In the following, we propose three optimal multiuser scheduling schemes that control 
the R-E tradeoff under different fairness considerations. 


2.2. Optimal Maximum Throughput (MT) Scheme 

First, we consider a scheduling scheme which maximizes the average sum rate subject 
to a constraint on the minimum required average aggregate harvested energy. We 
note that this scheme aims to reveal the best system performance, and fairness in 
resource allocation for UTs is not considered. To facilitate the following presentation, 
we introduce the user selection variables q„(i), where i E {1,2,... T} and nE {1,... ,1V}. 
In time slot i, if user n is scheduled to perform ID, q n (i) = 1, whereas q„(i) = 0, Vh / n, 
i.e., all the remaining idle users harvest energy from the transmitted signal. Now, we 
formulate the MT optimization problem as follows. 


2 We consider a unit-length time slot, hence the terms “power" and “energy" can be used interchangeably. 

2 (For currently available RF energy harvesters, the energy conversion efficiency can reach up to 0.7 [1]) 
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Problem 1 . Maximum Throughput Optimization: 


N 


maximize R s 
subject to Cl: Z q„(i) = 1 ,Vi, 

n=1 

C2: q„(i) e {0,1}, Vn,i, 
C3: Qsum — Qreq> 


( 2 . 2 ) 


where 


-j T N 

Kum = I™ - VVq n (i)C n (i), (2.3) 

T—>oo I 

i=l n=l 

j r » 

Qsum = lim - V V(1 - q„(i))Q n (i), and (2.4) 

T—>oo / - 

i=l n=l 

f Ph n (i)\ 

C n (i) = log 2 l l + -^-J. (2.5) 

Here, cr^ is the additive white Gaussian noise power at UT n. In the considered problem, 
we focus on the long-term system performance for T —> oo. Constraints Cl and C2 
ensure that in each time slot only one user is selected to receive information. C3 ensures 
that the average amount of harvested energy Q SU m i s no l ess than the minimum required 
amount Q req . Since the user selection variables q„(i), Vn, i, are binary, the problem in 
(2.2) is non-convex. In order to handle the non-convexity, we adopt the time-sharing 
relaxation. In particular, we relax the binary constraint C2 such that q n (i) is a continuous 
value between zero and one. Then, the relaxed version of problem (2.2) can be written 
in minimization form as: 





minimize 

_ D 

Iv sum 





subject to 

Cl, C3, 

(2.6) 


C2: 0 < q n (i) < 1, Vn, i. 



Now, we introduce the following theorem that reveals the tightness of the binary 
constraint relaxation. 
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Theorem 1. Problems in (2.2) and (2.6) are equivalent 3 with probability one, when 
h n (i),\/n,i are independent and continuously distributed. In particular, the constraint 
relaxation of C2 is tight, i.e., 

C2 <=> C2 : 0 < q n (i) < 1, Vn, i. (2.7) 

Proof. Theorem 1 will be proved in the following based on the optimal solution of 

( 2 . 6 ). n 

The constraint relaxed problem is convex with respect to the optimization variables 
and satisfies the Slater’s constraint qualification, therefore strong duality holds and the 
duality gap is zero. Hence, the optimal solution of the primal problem is equal to the 
optimal solution of the dual problem. We solve (2.6) via the dual problem to get insights 
into the structure of the solution. 

To this end, we first define the Lagrangian function for the above optimization problem 
which is given by: 


L{q n {i), A(i), a n (i), /3 n (i), v) = -R sum + ^ A(i) q n (i ) - 1 j 

T N T N 

+ ^^]a n (i) (q n (i) - 1) -^^I3 n (i)q n (i) + v (Q req - Q sum ) 

i=1 n=l i=1 n=l 


1 T N T f N \ T N 

f^J](lnii)C n (i) + ^?i(i) ^q n (i)- 1 +^a„(i)(q„(i)-l) 

i=l 11=1 i=l (n=l J i=l n =1 


T 

T N 


T N 


“22 PniOq n iO + V Qreq - “ 9n(0)Qn(0 

i =1 n =1 V 1 i=l n=1 

= _ T' C n(0 + ^(i) + an(0-^n(0 + V-Q n (i) ~^A(l) 

i=l n= 1 V 7 i=l 

T N ^ T N 

- X! 2 a »w +vQre q _ v 7 2 2 


( 2 . 8 ) 


t=i n=i 


i=l n=1 


where A(i), [3 n (i),a n (i) and v are the Lagrange multipliers corresponding to constraints 
Cl, C2, C3, and C4, respectively. Thus, the dual problem of (2.6) is given by 


maximize minimize L(q n (i), A(i), a n (i),/3 n (i), v). (2.9) 

a n (0A(0>0,A(i) q n (i) 

In order to determine the optimal user selection policy, we apply standard convex 
optimization techniques via the examination of the Karush-Kuhn-Tucker (KKT) conditions 
which are summarized in the following: 

3 Here, “equivalent" means that both problems share the same optimal q„(i). 
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1. Stationarity condition: the differentiation of the Lagrangian function with respect 
to the primal variables q n (i) Vn, i is equal to zero at the optimum point, i.e., 

dL 

7 T 7 = 0, Vi, n. (2.10) 

2. Primal feasibility condition: the optimal solution has to satisfy the constraints of 
the primal problem. 

3. Dual feasibility condition: the Lagrange multipliers for the inequality constraints 
have to be non-negative, i.e., 


a n (i) > 0, Vi,n, (2.11a) 

/UO>0, Vi, n, (2.11b) 

v > 0. (2.11c) 

4. Complementary slackness: if an inequality is inactive, i.e., the optimal solution is 
in the interior of the corresponding set, the corresponding Lagrange multipliers 
are zeros, i.e., 


« n (0 (q n (0 - 1) = 0, Vi, n, (2.12a) 

/3„C0q n (0 = 0, Vi, n, (2.12b) 

^ (Qreq - Qsum) = 0- (2.12c) 

In order to determine the optimal selection policy, we differentiate the Lagrangian in 
(2.8) with respect to q n (i) and set it to zero: 

dL 1 1 

= --C n (i) + A(i) + a n (i)-^ n (i) + -vQ n (i) = 0, Vi, n. (2.13) 

dq n { 0 T T 

We define n* as the index of the user which should be optimally selected in time slot i, 
i.e., q n *(i ) = 1. Then, the necessary conditions for q n *(i) = 1 are 

q n (0 = 0, Vn^n*, (2.14a) 

a n (i) = 0, Vn / n*, (2.14b) 

j8„.(0 = 0, (2.14c) 

where (2.14a) follows from Cl in (2.6), (2.14b) follows from (2.12a), and (2.14b) 
follows from (2.12a). 
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In the following, we substitute (2.14) into (2.13) and introduce the selection metric 

A n (0 as 


A n *(i) = T (A(i) + a„*(i)) = C„.(i) - v*Q„.(i), (2.15a) 

A„(i) = T (A(i) -/3 n (i)) =C n (i)-v*Q n (i), Vn/n*. (2.15b) 

By subtracting (2.15b) from (2.15a), we obtain 

A„*(i) - A„(i) = T (a n *(i) + /3„(i)) . (2.16) 

From the dual feasibility conditions in (2.11), we know that a n (i) > 0 and /3„(i) > 0 
which yields 


a n .(i) + j8„(0 > 0, (2.17a) 

A„*(i) — A„(i), Vn/n*. (2.17b) 

We note that the probability that A„*(i) = A n (i) is zero Vn / n*, Vi since A n (i) Vn, are 
continuous random variables. Thus, the selection criterion for the MT scheme reduces 
to 

A„*(i)= max {C„(i) - v*Q n (i)}, (2.18) 

ne{l,...,JV} 

where the Lagrange multiplier v* is chosen such that the constraint on the harvested 

energy C4 is satisfied, v* works as threshold which depends only on the long-term 

statistics of the channels. Thus, it can be calculated offline and used for online multiuser 
scheduling as long as the channel statistics remain unchanged. We note that although 
the original problem in (2.18) considers infinite number of time slots and long-term 
averages for the sum rate and the total harvested energy, interestingly, the optimal 
scheduling rule in (2.18) depends only on the current time slot, i.e., online scheduling is 
optimal. Besides, the solution of the relaxed problem is itself of the Boolean type since 
q n (i) e {0,1}. Therefore, the adopted binary relaxation is tight. 

On the other hand, the optimal value of v* can be obtained iteratively via the gradient 
method which is described in Algorithm 1, where m is the iteration index and 0 is an 
appropriately chosen step. 
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Algorithm 1 Gradient algorithm for v* 

initialize the iteration index m = 0 and dual variable v[0] 
repeat 

1. Generate a sufficiently large number of channel realizations, compute the 
metric in (2.18) and make a selection of user n*. 

2. Compute the average harvested energy Q sum . 

3. Update v [m + 1] = max{v [m] + V v 0,0}, where V v = Q req - Q sum . 
until convergence to v*. 


From (2.18), we have the following observations: 

• If the minimum required harvested energy is not stringent and is always satisfied, 
e.g. Q req = 0, then the problem formulation is equivalent to the conventional 
maximum sum-rate scheduling scheme. In this case, the second term of the 
selection metric in (2.18) is equal to 0 and therefore v* = 0. 

• For a fixed long-term channel statistic, a more stringent minimum required power 
transfer Q req always lead to a larger the value of v*. 


Simulation Results 

Now, we perform simulations for the designed optimal resource allocation scheme. The 
important simulation parameters are summarized in Table 1. We adopt the path loss 
model from [36] and the UTs are randomly and uniformly distributed between the 
reference distance and maximum service distance. 

We use the order-based SNR scheduler from [26] as a baseline scheme. This scheduler 
performs user selection according to the following rule: 

n*(i ) = arg order h n (i) (2.19) 

where arg order is defined as the argument of a certain selection order j £ {1,...,JV}. 
That is, the user whose channel power gain fr n (i) has order j is scheduled for ID. 

Figure 2.2 shows the average sum rate (bits/(channel use)) versus the average sum 
harvested energy (Watts) of the MT schemes for different numbers of users. We note that 
the suboptimal order-based scheme can only achieve discrete points on the R-E curves, 
corresponding to the selection orders j e {1,...,1V}. On the contrary, the proposed 
optimal MT scheduling scheme can achieve any feasible point on the R-E curve, which 
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Table 2.1.: Simulation parameters. 


Parameter 

Value 

AP transmit power P 

40 dBm 

Noise power cr 2 

— 62 dBm 

RF-to-DC conversion efficiency E, n 

0.5 

Path loss exponent 

3.6 

Maximum service distance 

100 m 

Reference distance 

2 m 

Antenna gain of AP and UTs 

10 dBi & 2 dBi 

Carrier center frequency 

915 MHz 

Bandwidth 

200 kHz 

Fading channel 

Rayleigh 



Figure 2.2: Average sum rate versus average sum harvested energy of the MT schemes 
for different numbers of UTs. 


provides a higher flexibility for the system designer to strike a balance between average 
sum rate and average harvested energy. Besides, as expected, the average system sum 
rate increases with the number of UTs as the proposed scheme is able to exploit multiuser 
diversity. Furthermore, the average sum harvested energy also increases with the number 
of UTs since more idle users participate in EH in any given time slot. 
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2.3. Optimum Proportional Fair (PF) Scheme 

In the MT scheme, UTs with weak channel conditions may be deprived from gaining 
access to the channel which leads to user starvation. In order to strike a balance between 
system throughput and fairness, we introduce proportional fairness into our scheduler, 
which aims to provide each UT with a performance proportional to its channel conditions. 
This is achieved by allowing all UTs to access the channel with equal chances. In this 
case, the optimization problem with the relaxed binary constraint on the user selection 
variables is formulated as: 



where constraint C4 specifies that each user has to access the channel for ^ number 
of time slots, T is the total number of time slots, and N is the number of users in the 
system. 

Since q„(i), Vn, i, is non-convex due to its binary nature, we relax the binary constraint 
to obtain a convex problem. We note that the adopted constraint relaxation is tight and 
the proof is given in Appendix A.l. Thus, we obtain the following reformulated problem 
in minimization form: 
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The Lagrangian function for the problem (2.21) is given by: 

t r n 

h(q n (i), A(i),a n (i),^ n (i),v,yJ = -R sum + ^ A(i) ^q n (i) - 1 

i=l \n=l 

T N T N N f 1 T lA 

+ XX a n(0(qn(0- !) — ^nCOQnCO + ^ (Qreq ~ Qsum) + X Tn ~ ~ 

i=1 n= 1 i—1 n=l n=l V 1 i=l JV 7 


1 


T N 


N 


T N 


EE q n (0c n (i) + 2^W X g "^ _1 + EE“ n (0(q n (0-i) 


i=l n=l 

T N 


i =1 


L n=l 
T N 


i=l n=l 


IV 


Qreq- + X ^ ^X 9 "^ - 


i=l n=l 
T 1 V 


i=l n=l 


n=l 


;=i 


JV 


j ^ iv ^ f \ 1 1 \ j ^ 

= XX g ^^ _ 7 C n(0 + A(i) + a n (i)-^ n (i) + v-Q n (i) + -y n -^A(i) 

1=1 n=l ^ 1 1 1 y i =1 

T N -j T N N i 

-XX a "® + vQ -q - v r X X Q " w ~ X ^ V ’ 


( 2 . 22 ) 


i=l n=i 


i=l n=l 


n=l 


where A(i),/3 n (i), a n (i), v, and y n are the Lagrange multipliers corresponding to con¬ 
straints Cl, C2, C3, C4, and C5, respectively. 

We study the structure of the optimal scheduling policy via the KKT conditions. The 
stationarity condition is then given by 

dL 1 11 

P r , = --Cn(i) + A(i) + a„(i) - 0 n (i) + -vQ„(i) + = 0. (2.23) 

dq n (i) T T T 


Analogously to the MT scheme, we introduce the selection metric A n (i) as 

A n .(i) = T (A(i) + a n .(i)) = C„.(i) - v*Q n *(i) - y*„ (2.24a) 

A„(i) = T (A(i) - j8 n (i)) = C n (i) - vU(i) - Vn # n*, (2.24b) 

where n* is the user that is optimally scheduled for information reception. Subtracting 
(2.24b) from (2.24a) we obtain 

A AO - A„(0 = T (a„*(i) + j8„(i)) • (2.25) 


From the dual feasibility conditions, it follows that a n (i) > 0, (5 n {i ) > 0, therefore 


« n *(0 + ^ n (0 > 0 , 

A„*(i) > A„(i), Vn/n*. 


(2.26a) 

(2.26b) 




2.3. Optimum Proportional Fair (PF) Scheme 
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We note that the probability that A n *(i) = A n (i) is zero, since A„(i) are continuous 
random variables. Hence, the selection criterion for the PF scheme reduces to 

A„.(i)= max {C„(i) - v*Q n (i) - y*}, (2.27) 

nS{l,...,N} 

where Lagrange multiplier v* for constraint C4 ensures that at least Q req amount of 
energy is harvested. Lagrange multipliers y* for constraint C3 guarantee that each user 
accesses the channel equal number of times. Analogously to the MT scheme, v* and y* 
depend only on the long-term statistics of the channels, therefore they can be calculated 
offline and used for online multiuser scheduling as long as the channel statistics remain 
unchanged. Besides, the optimal scheduling interestingly depends only on the current 
time slot, i.e., online scheduling is optimal. We note that the optimal PF scheduling 
rule is similar to the MT scheduling rule in (2.27), but the PF selection metric in (??) 
contains an additional term y* that provides proportional fairness. 

The optimal values of v* and y* can be obtained iteratively via the gradient method 
as described in Algorithm 2, where m is the iteration index. Variables © and T are 
appropriately chosen step sizes to facilitate the convergence of the gradient method. 


Algorithm 2 Gradient algorithm for v* and y* 

initialize the iteration index m = 0, v[0], and y n [0] 

repeat 

1. Generate a sufficiently large number of channel realizations, compute the 
metric in (2.27) and make a selection of user n*. 

2. Compute the average harvested energy Q sum and the number of selections of 
each user. 

3. Update v [m + 1] = max{v [m] + V v 0,0}, where V v = Q req - Q sum , 

update y„[m +1] = y„[m] + V r T, where V r = j £[ =1 q n (i ) - 
until convergence to v* and y*. 


Simulation Results 

Next, we perform simulations for the scheme investigated in this section. The simulation 
parameters are assumed the same as in Section 2.2. As for the baseline scheme, we will 






16 


Chapter 2. Optimum Online Multiuser Scheduling 



Figure 2.3: Average sum rate versus average sum harvested energy of the PF schemes 
for different numbers of UTs. 


use the order-based normalized-SNR (N-SNR) scheduler from [26], which performs user 
selection according to the following rule: 

h (i) 

n*(i) = arg order-, (2.28) 

where Q n denotes the mean channel power gain of UT n. This scheduling rule also 
ensures PFness, i.e., all users gain access to the channel with equal number of times. 

Figure 2.3 depicts the average sum rate (bits/(channel use)) versus the average sum 
harvested energy (Watts) for the PF. It can be seen that the feasible R-E region of all 
schemes decreases compared to the MT scheduler in Figure 2.2. This is because the PF 
scheduler takes fairness into account in the resource allocation and, as a result, cannot 
fully exploit the multiuser diversity for improving the average system sum rate. On the 
other hand, it can be seen that our proposed optimal schemes provide a substantial 
average sum rate gain compared to the corresponding suboptimal order-based scheme, 
especially for a high amount of average harvested energy in the system. In fact, the 
proposed optimization framework provides more degrees of freedom across different 
time slots in resource allocation compared to the suboptimal scheduling schemes. This 
allows the system to exploit the multiuser diversity to some extent for resource allocation 
even if fairness is taken into consideration. 


















2.4. Optimum Equal Throughput (ET) Scheme 
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2.4. Optimum Equal Throughput (ET) Scheme 

Although the PF scheduler enables equal channel access probability for all UTs, it does not 
provide any guaranteed minimum data rate to them. On the contrary, the ET criterion is 
more fair from the users’ prospective compared to the PF criterion, as all the UTs achieve 
the same average throughput asymptotically for T —> oo. Therefore, in this section, 
we design a scheduler which achieves ET fairness. Thus, the objective is to maximize 
the minimum average achievable rates among all the UTs, i.e., maximize minC n where 

n 

C n = lim^^ ^ 2 i=1 q n (i)C n (i). The optimization problem is then formulated as follows: 


Problem 3. Optimal Equal Throughput Optimization: 

maximize min C n 

q nCO.Vi.n 

N 

subject to Cl: Z q„(0 = i ,Vi, 

n =1 

C2: q n (i) (1 - q„(0) =0,Vn,i, 

C3: Q sum > Qreq- 


Equation (2.29) is a max-min optimization problem, which can be rewritten in its 
equivalent hypograph form: 


maximize 

r,q„(i),Vi,n 

r 


subject to 

Cl: C n > r, Vn, 



N 

C2: ^]q n (i) = l,Vi, 

n= 1 

(2.30) 


C3: q„(0 (1 - q„(0) =0,Vn,i, 



C4: Q sum > Q req - 



r is an auxiliary optimization variable for handling the max-min objective function. 
After binary relaxation of q„(i), Vn, i, we formulate our convex optimization problem in 
minimization form as follows 


minimize 

— r 


r,q n (i)yi,n 



subject to 

Cl, C2, C5, 

C3: — q n (0 < 0, Vn,i, 

C4: q n (i) - 1 < 0, Vn,i, 

(2.31) 
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We note that the adopted binary constraint relaxation is tight and please refer to 
Appendix A.l for the proof. 


T N 


;=1 n =1 


The Lagrangian function for the problem (2.31) is then given by 

L(q n (i), A(i)> a„(0, /3„(0, v, 0 n ) = -r + X A(i) j X<?n(0 - 1 ] + X XO?n(0 “ x ) 

i=l V"=l 

T N N 

-XX^(0 + v(Q req -Q sum ) +X 0n 0-<y 

i=l n=l n=l 

T N T T N 

= -r + X«oX^)-X«o+XX «n(0qn(0- zz a n (0- zz Ai(0q n (0 


T N 


T N 


i =1 n=l 

T N 


i=l i=1 n=l 

T JV 


i=l n=l 
AT N 


t=l n=l 
T 


J JV J JV JV JV 1 

+ v Qreq — v ^zz Qn(0 + V- ZZ q n (0Q n (0 + X^ r_ X^r X C„(0q n (0 

1 i= l n=l 1 i=l n=l n=l n=l 1 i= 1 

T N s 1 1 . 

= -r + XX 9 "^ + ~ + V-Q n (i) - 0 n -c n (i ) 

i=l n=l A 1 1 d 

T T N T N N 

-Z A «-ZZ a n (0 + vQ req - v- ZZ Qn(0 + X^ r ’ (2 ' 32) 


i=l 


i—1 n=l 


i=l n=l 


n=l 


where @ n ,A(i),/3 n (i), a n (i), and v are the Lagrange multipliers corresponding to con¬ 
straints Cl, C2, C3, C4, and C5, respectively. 


We study the structure of the optimal scheduling policy via the KKT conditions. The 
stationarity condition is expressed as 

dL 11 

—— = A(i) + a (i) - 0 n (i) + v-Q„(i) - 0 n -C n (i) = 0, (2.33a) 

3q„(0 T T 

dL N N 

-=-i + X^=o=^x^ =1 - (2 - 33b) 

n=l n=l 


Next, we introduce the selection metric A„(i) for our ET scheduler as 

A n *(0 = T (A(i) + « n *(0) = 0**C„*(i) - v*Q n *(i), (2.34a) 

A n (i) = T(A(i)-^ n (i)) =0*C n (i)-v*Q n (i), Vn/n*, (2.34b) 

where n* is the optimally scheduled user index. 

Subtracting (2.34b) from (2.34a), we get 


A n*(0 - A n(0 = T (a„*(0 + 0n(O) • 


(2.35) 
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From the dual feasibility conditions, we know that a„(i) > 0, /3 n (i) > 0, therefore: 


a n * + ^n(0 > 0, 

A„*(i) — A n (0> Vn/n*. 


(2.36a) 

(2.36b) 


Similar to the MT and PF schemes, A n (i) Vn are continuous random variables. Hence, 
the probability that A (i) = A (i) for n x 7 ^ n 2 is zero. Thus, we obtain the following 
selection criterion for the ET scheme: 



(2.37) 


where Lagrange multiplier v* ensures that constraint C5 for the minimum requirement 
of harvested energy is satisfied and Lagrange multipliers 9* ensure that all the users 
have ET. Analogously to the previously described schemes, v* and 9* only depend on 
the long-term statistics of the channels, therefore they can be calculated offline and used 
for online multiuser scheduling as long as the channel statistics remain unchanged. 

The optimal values of v* and 9* can be obtained iteratively via the gradient method 
described in Algorithm 3, where m is the iteration index and the appropriately chosen 
step sizes 0 and £ guarantee convergence of v and 9 n to the optimal dual variables v* 
and 9*, respectively. We note that 9* e [0,1]. This is because 6 n > 0 from the dual 
feasibility condition of the inequality constraint Cl and 9 n < 1 since ^] n=1 6 n = 1 from 
(2.33b). 

Algorithm 3 Gradient algorithm for v* and 9* 

initialize m = 0, v[0] and 0 n [O] 

repeat 

1. Generate a sufficiently large number of channel realizations, compute the 
metric in (2.37) and make a selection of user n*. 

2. Compute the average total harvested energy Q sum and the average rate per 
user C n = 7 ULt c n( f )q n (0. 

3. Update v [m + 1] = max{v [m] + V v 0,0}, where V v = Q req - Q SU m> 

update 9 n [m + 1] = [9 n [m] + V e £] *, where V e = r - C n . 

until convergence to v* and 9*. 


Next, we perform simulations for the proposed optimal ET scheduler. The simulation 
parameters are assumed to be the same as in Sections 2.2 and 2.3. As for the baseline 
scheme, we will use the order-based ET scheduler from [26] . For this scheme, the users’ 
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Figure 2.4: Average sum rate versus average sum harvested energy of the ET schemes 
for different numbers of UTs. 


instantaneous N-SNRs are sorted in ascending ordered, and then among the set of users 
whose N-SNR orders fall into a predefined set of allowed orders 5^, the AP schedules 
the one with the minimum moving average throughput. Therefore, the selection rule of 
the order-based ET scheme is 


n*(i) = arg min r n (i — 1), (2.38) 

where Ou e {1,... ,1V} is defined as the order of the instantaneous N-SNR of user n, and 
r n (i — 1) is the throughput of user n averaged over previous time slots up to slot i — 1. 

The R-E curves of the considered schemes are shown in Figure 2.4. The optimum 
ET scheduler provides a substantial sum rate gain compared to the order-based ET 
scheduler. In particular, the proposed optimal scheduler is able to exploit the degrees 
of freedom across different time slots for improving the system performance. On the 
other hand, the average sum harvested energy of the proposed scheme increases rapidly 
with the numbers of receives in the system due to more UTs participate in EH. Besides, 
the average sum rate increases slightly with the numbers of UTs compared to the MT 
scheduler. In fact, the optimal ET scheme cannot fully exploit the multiuser diversity for 
improving the average system sum rate due to the required fairness in equal average 
data rate for all UTs. 
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Optimum Online Multiuser 
Scheduling with Power Allocation 


In this chapter, we enhance the MT scheduling scheme (c.f. Section 2.2) by consid¬ 
ering joint user scheduling and power allocation. To this end, we adopt the problem 
formulation in (2.2) and introduce an extra power allocation variable P n (i) to user n in 
time slot i, as an additional degree of freedom. Thus, we obtain the following problem 
formulation: 


Problem 4. Joint Power and Scheduling Optimization: 



N 


subject to 



n= 1 


C2: q n (i) (l -q n (i)) =0,Vn,i, 


N 


(3.1) 


C3: ^P n (i)q n (i)<P max ,Vi, 


C4: 




i=l n =1 

T N f N 


i=l n =1 


Constraint C3 specifies a hardware constraint which limits the maximum instantaneous 
transmit power to P max . Constraint C4 constrains the average transmit power budget to 
P ave . The term Pk^Qk^O i n C5 represents the total radiated power in time slot i. 
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The objective function in (3.1) is non-convex. In order to convexify the objective 
function, we use the following change of variables: P'(i) = P n (i)q n (i). Also, we relax 
the binary constraint on the user selection variables q n (i) and rewrite (3.1) as 



Now the objective function in (3.2) is concave, since /(x) = log(l + x) is concave and 
f(x,y ) = y log(l + j) is jointly concave with respect to x and y [37]. Nevertheless, 
constraint C6 is still non-convex due to the coupling of the optimization variables. In 
the following, we adopt the big-M formulation to linearize the coupled terms (1 — 
q„(i)) (SjLi ^fc(O) ■ The new problem formulation is given by 


Problem 5. Problem Reformulation: 


maximize — ^ q„(i) log. 


subject to 


C1-C2, 



P^i)h n m 

QnCOo- 2 J 


N 

C3: ^P,;(i)<P max ,Vi, 

n= 1 

1 T N 

C4: 

1 i= 1 n = 1 

-j T N f N "N 

C5; mo >Q teq , 

;=i n =i Vfc=i 7 

C6: i>f“ 1 (i) < (1 - Vn, i, 

C7: 

C8 . pvirtual^-) > Q, V? 1 ,j, 


(3.3) 
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Figure 3.1: Average sum rate versus average sum harvested energy of the MT scheme 
with joint power allocation and user selection for different numbers of UTs. 

where F^' ,rtual (i) is auxiliary variables for solving the problem. In fact, variable P;" lual (i) 
can be treated as the virtual transmit power variable which is controlled by q n (i) in 
constraint C7 and the actual transmit power P'(i) in constraint C8. We note that 
both Problems (3.3) and (3.2) are equivalent when q n (i) is binary. In particular, both 
problems share the same optimal solution. Besides, by following a similar approach as in 
Appendix A.I., it can be shown that the binary relaxation on q n (i) is tight at the optimal 
solution, i.e., q*(i) e {0,1}. More importantly, Problem (3.3) is a convex optimization 
problem which can be solved efficiently via standard numerical solvers designed for 
convex programs. 


Simulation 

Next, we perform simulations to verily the performance of the proposed jointly optimal 
power allocation and user scheduling algorithm. We adopt the same setup as in Chapter 
2. The maximum average transmit power P ave is 10 Watt and the maximum instantaneous 
transmit power P max is set to 46 dBm. Figure 3.1 shows the R-E regions for the proposed 
optimal resource allocation scheme with joint user selection and power allocation for 
different power allocation schemes and different users. For comparison, we also show 
the performance of the fixed power resource allocation scheme proposed in Chapter 2. 
It can be observed that the average achievable rate and the average harvested energy 
increases with the number of users. In fact, the joint power allocation and user selection 
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can effectively exploit the channel fluctuations to improve the system performance. 
Besides, a large harvested power gain can be achieved by the proposed joint optimization 
scheme over the fixed power allocation scheme. The power allocation variables provide 
more degrees of freedom across different time slots in resource allocation. Specifically, 
compared to fixed power transmission, a larger power is transmitted to improve the 
system performance when the channel condition is good. Also, a smaller transmit power 
is allocated when the channel is in deep fading which preserves the energy of transmitter 
to exploit the good channel conditions in the future. 
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Chapter 4. 
Conclusion 


In this thesis, we have proposed optimal multiuser scheduling schemes for SWIPT systems 
considering different notions of fairness in resource allocation. The designed schemes 
enable the control of the tradeoff between the average sum rate and the average amount 
of sum harvested energy. Our results reveal that for the maximization of the system sum 
rate with or without fairness constraints, the optimal scheduling algorithm requires only 
causal instantaneous and statistical channel knowledge. Simulation results revealed that 
substantial performance gains can be achieved by the proposed optimization framework 
compared to existing suboptimal scheduling schemes. Besides, joint user selection and 
power allocation was also investigated for the considered SWIPT system. The obtained 
solution revealed that the proposed optimal resource allocation scheme improves the 
performance and enlarges the feasible R-E region compared to the scheme with fixed 
AP transmit power. Further investigation on the impact of imperfect CSI for resource 
allocation is left for future work. 
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Appendix A. 

Proof of Optimality of Binary 
Relaxation 

A.l. Optimality of Binary Relaxation for Optimum 
Multiuser Scheduling Schemes 


We prove that the optimal solution of the problem in (2.18) with the relaxed constraint, 
0 < q n (i) < 1, selects the boundary values of q n (i), i.e., 0 or 1. Therefore, the binary 
relaxation does not change the solution of the problem. 

If one of the q n (i) adopts a non-binary value in the optimal solution, there has to be 
at least one other non-binary selection variable in the same time slot i. We assume that 
the indices of the non-binary selection variables are n and n in the i-th time slot. Then, 
for the optimization problem corresponding to the MT scheduling scheme in (2.18) 
we obtain a n (i) = 0 Vn from (2.12a) and /3y(i) = 0 and /3 n "(i) = 0 from (2.12b). By 
substituting these values into (2.13), we obtain 


T(A(0) = A„'(0, 
r(A(0) = A„"(0. 


(A. la) 
(A. lb) 


From (A.la) and (A.lb), it follows that Ay(i) = A n "(i). However, due to the ran¬ 
domness of the time-continuous channel gains, Pr{Ay(i) = A„"(i)} = 0, where Pr{-} 
denotes probability. Therefore, the optimal q n (i) e {0,1} Vn, i and (2.18) is the optimal 
selection policy. 

The tightness of binary constraint relaxation for the PF scheduling and the ET schedul¬ 
ing can be proved by following a similar approach as for the MT scheduling. 



